Leveraging Community-Built Knowledge for Type Coercion in Question Answering
نویسندگان
چکیده
Watson, the winner of the Jeopardy! challenge, is a state-of-the-art open-domain Question Answering system that tackles the fundamental issue of answer typing by using a novel type coercion (TyCor) framework, where candidate answers are initially produced without considering type information, and subsequent stages check whether the candidate can be coerced into the expected answer type. In this paper, we provide a high-level overview of the TyCor framework and discuss how it is integrated in Watson, focusing on and evaluating three TyCor components that leverage the community built semi-structured and structured knowledge resources -DBpedia (in conjunction with the YAGO ontology), Wikipedia Categories and Lists. These resources complement each other well in terms of precision and granularity of type information, and through links to Wikipedia, provide coverage for a large set of instances.
منابع مشابه
Mining Knowledge from Large Corpora for Type Coercion in Question Answering
A fundamental issue in natural language processing and knowledge acquisition is recognizing whether a given string is an instance of a given type. In this paper, we present a solution to the typing problem by mining knowledge from an unstructured text corpus, and apply it in the context of type coercion in question answering. We describe a new generate-and-type framework, called TyCor (short fo...
متن کاملTyping candidate answers using type coercion
using type coercion J. W. Murdock A. Kalyanpur C. Welty J. Fan D. A. Ferrucci D. C. Gondek L. Zhang H. Kanayama Many questions explicitly indicate the type of answer required. One popular approach to answering those questions is to develop recognizers to identify instances of common answer types (e.g., countries, animals, and food) and consider only answers on those lists. Such a strategy is po...
متن کاملHi, how can I help you?: Automating enterprise IT support help desks
Question answering is one of the primary challenges of natural language understanding. In realizing such a system, providing complex long answers to questions is a challenging task as opposed to factoid answering as the former needs context disambiguation. The different methods explored in the literature can be broadly classified into three categories namely: 1) classification based, 2) knowled...
متن کاملQuestion Difficulty Estimation in Community Question Answering Services
In this paper, we address the problem of estimating question difficulty in community question answering services. We propose a competition-based model for estimating question difficulty by leveraging pairwise comparisons between questions and users. Our experimental results show that our model significantly outperforms a PageRank-based approach. Most importantly, our analysis shows that the tex...
متن کاملارایه یک پیکره پرسش و پاسخ مذهبی در زبان فارسی
Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011